Conference Proceedings

E-Storm: Replication-based State Management in Distributed Stream Processing Systems

Xunyun Liu, Aaron Harwood, Shanika Karunasekera, Benjamin Rubinstein, Rajkumar Buyya

Proceedings of the International Conference on Parallel Processing | IEEE COMPUTER SOC | Published : 2017

Abstract

Apache Storm is a fault-tolerant, distributed inmemory computation system for processing large volumes of high-velocity data in real-time. As an integral part of the fault-tolerance mechanism, Storm's state management is achieved by a checkpointing framework, which commits states regularly and recovers lost states from the latest checkpoint. However, this method involves a remote data store for state preservation and access, resulting in significant overheads to the performance of error-free execution.In this paper, we propose E-Storm, a replication-based state management system that actively maintains multiple state backups on different worker nodes. We build a prototype on top of Storm by ..

View full abstract

Grants

Awarded by Australian Research Council Future Fellowship


Funding Acknowledgements

The authors would like to thank Chenhao Qu and other members of the CLOUDS Lab at the University of Melbourne for their valuable comments towards improving the quality of the paper. This work is supported by Australian Research Council Future Fellowship and DP150103710.